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ABSTRACT 


The  purpose  of  this  paper  is  to  demonstrate  that,  when  the  number  of  targets  is  not  known 
beforehand,  Bayesian  optimal  filtering  approaches  to  multisensor-multitarget  data  fusion  problems 
encounter  xmexpected  conceptual  and  practical  difficulties.  The  reason  is  that  single-target 
Bayesian  filtering  cannot  be  naively  generalized  to  multitarget  situations  and  that,  consequently, 
serious  pitfalls  await  those  who  simply  "declare  victory. "  In  particular,  we  show  that  the  classical 
Bayesian  techniques  for  optimally  determining  parameters  of  interest— e.g.,  the  maximum  a 
posteriori  (MAP)  and  expected  a  posterior  (EAP)  estimators— cannot  even  be  defined  in  multitarget 
situations.  We  describe  our  solution  to  this  problem,  "finite-set  statistics"  (FISST),  as  well  as 
"joint  multitarget  probabilities  (IMP),"  a  renaming  of  a  special  case  of  FISST.  We  show  how 
FISST  leads  to  provably  Bayes-optimal  multisensor-multitarget  data  fusion  algorithms.  We  discuss 
the  optimality  and  convergence  properties  of  two  different  Bayesian  data  fusion  algorithms. 

1  Introduction 

Suppose  that  a  single  sensor  observes  a  single  moving  target  and  that: 

(1)  x„  is  the  state  of  the  target  at  time-step  a; 

(2)  z„  is  the  measurement  of  the  target  collected  by  a  single  sensor  at  time-step  a; 

(3)  2“  =  /zj  ,...,z^  is  the  set  of  accumulated  evidence  at  time-step  a; 

(4)  f(z  I  x)  is  the  sensor  likelihood  function; 

(5)  /a+i  I  1  xj  is  the  Markov  motion  model  for  the  target;  and 

(6)  fct  I  I  is  the  posterior  density  of  the  target  state  at  time-step  a,  given  all  accumulated  data  2* 

Then  the  following  well-known  Bayesian  recursive  nonlinear  filtering  equations  are  the  basis  for  recursive  Bayesian 

optimal  single-target  tracking  [1,2, 7, 9  p.  174,41]: 
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Given  the  posterior,  the  current  target  state  can  be  approximated  using  any  of  the  classical  Bayes-optimal 
estimators,  e.g.  the  maximum  a  posteriori  (MAP)  or  expected  a  posteriori  (EAP)  estimators. 

1.1  "Naive"  Multitarget  Recursive  Bayesian  Nonlinear  Filtering,  Ideally,  one  would  like  to  directly  generalize 
this  standard  and  well-understood  approach  to  multisensor,  multitarget  problems.  That  is,  suppose  that 

(r)  =  /x,.^  is  the  set  of  states  of  the  unknown  number  n(a)  of  targets  at  time-step  a; 

(2')  is  the  set  of  observations  collected  by  all  sensors  at  or  around  time-step  a; 

(3')  :  Z;  is  the  accumulated  evidence  at  time-step  a; 

(4')  f(Z  I  X)  is  the  likelihood  function  for  the  entire  suite  of  sensors  collecting  on  all  targets; 

(5')  /«+/ 1  I  X^J  is  the  multitarget  Markov  motion  model  for  the  entire  multitarget  system;  and 

(6')  /a  I  a(^a  I  the  multitaTget  posterior  density  of  the  multitarget  state  X^  at  time-step  a,  given  all 

evidence  Z^“^  accumulated  up  until  that  time. 

As  viewed  from  a  ’’naive"  perspective,  it  would  seem  that  all  that  is  needed  is  to  write  down  the  following 
multitarget  versions  of  the  recursive  Bayesian  nonlinear  filtering  equations: 


Likewise,  given  the  multitarget  posterior  distribution  /„  |  |  we  would  need  only  apply  one  of  the  classical 

Bayes-optimal  estimators  to  arrive  at  an  estimate  of  the  current  multitarget  state  X^  .  We  would  then  be  able  to 
estimate,  in  a  simultaneous  and  optimal  manner,  the  number  h  of  targets  as  well  as  the  individual  states  Xj  of 
the  targets,  without  any  need  to  determine  an  optimal  report-to-track  association.  Stated  in  different  words:  Such 
a  procedure  would  optimally  resolve  the  conflicting  objectives  of  detection  and  tracking  (as  well  as  identification, 
if  target  I.D.  state  and  measurement  variables  are  present). 

Unfortunately,  when  one  tries  to  apply  the  standard  statistical  thinking  just  described  to  the  multitarget  case  with 
unknown  number  of  targets,  one  quickly  discovers  that  taking  things  for  granted  leads  to  serious  troubles  that 
directly  bear  on  practice.  Specifically,  Bayes-optimal  multitarget  estimation  and  filtering  encounters  fundamental 
conceptual  and  practical  difficulties  when  the  number  of  targets  is  unknown.  What  is  at  issue  is  neither  theoretical 
hair-splitting  nor  mere  mathematical  "bookkeeping."  Rather,  what  is  actually  at  stake  is  our  ability  to  do  Bayes- 
optimal  multitarget  filtering  and  estimation  at  all  and,  moreover,  our  ability  to  even  know  what  ""Bayes-optimar 
means  in  such  a  context. 

This  paper  is  the  third  in  a  series  devoted  to  explaining  these  difficulties;  the  first  two  were  "Multisource, 
multitarget  filtering:  A  unified  approach"  [29]  and  "Multitarget  Markov  Motion  Models"  [30].  In  these  papers  and 
in  the  hook  Mathematics  of  Data  Fusion  [6],  we  described  a  theoretically  rigorous,  systematic,  and  unified  resolution 
of  these  and  other  difficulties  based  on  a  special  case  of  random  set  theory  [3,8,36]  called  "finite-set  statistics 
(FISST)  [6,8,21-26,28-35,40].  We  also  explained  why  FISST  (or  something  of  equal  rigor)  is  necessary  to  deal 


with  the  associated  problems.  In  the  second  paper  we  elaborated  on  item  (5')  above:  The  definition  and 
construction  of  computationally  tractable  Markov  motion  models  capable  of  effectively  dealing  with  the  dynamicism 
of  real-world  problems.  The  present  paper  has  two  purposes:  First,  to  summarize  and  explain  the  major  difficulties 
by  contrasting  true  multitarget  recursive  Bayesian  nonlinear  filtering  with  three  approaches  that  fail  to  address  these 
difficulties.  Second,  to  elaborate  on  item  (6'):  extracting  the  information  contained  in  the  multitarget  posterior  so 
that  the  associated  filter  will  rapidly  and  stably  converge  to  the  correct  answer. 

1.2  A  Short  History  of  Multitarget  Recursive  Bayesian  Nonlinear  Filering.  The  concept  of  multitarget  recursive 
Bayesian  nonlinear  filtering  is  a  relatively  new  one  in  the  data  fusion  engineering  community.  The  problems 
associated  with  it  are  greatly  alleviated  (but  not  completely  eliminated)  if  we  assume  that  the  number  of  targets, 
n,  is  known.  The  history  of  this  "known-n”  aspect  of  the  problem  is  summarized  in  Table  I. 

TABLE  I:  Multitarget  Recursive  Bayesian  Nonlinear  Filtering:  No.  of  Targets  Assumed  Known 


Date 

Author(s) 

Theoretical  Basis 

1987 

Washburn  [45] 

Point  Processes 

1991 

Xie  and  Evans  [46] 

Hidden  Markov  models 

1992 

Kamen  et.  al.  [10-12,38] 

Symmetric  Measurement  Equations  * 

Multitarget  measurement  model 
transformed  into  nonlinear 
measurement  model 

1993 

Kastella  [14] 

"Event-Averaged  MLE" 

Multitarget  EKF  using 
saddle-point  approximation 

1995 

Krishnamurthy  and  Evans  [18] 

Hidden  Markov  models 

To  our  knowledge  the  earliest  work  is  due  to  Washburn,  who  used  point  process  theory  to  model  both  multitarget 
observations  and  multitarget  states  and  then  constructed  a  multitarget  likelihood  function  defined  in  terms  of  these 
point  processes.  (Note:  Point  process  theory  is  closely  related  to  random  set  theory,  see  [3],)  Kamen  and  his 
associates  have  attacked  the  problem  by  constructing  a  multitarget  measurement  model  for  a  Gaussian  sensor. 
Specifically,  the  single-target  measurement  models  Zj  =  C!xi  +  Wi  are  transformed  into  a 

conventional  (but  nonlinear)  sensor  model  Z  -  f(K)  +  V  where  X  =  (Xi  ,...,xj  and  Z  is  a  vector  whose 
components  are  symmetric  functions  of  the  measurements.  In  Kastella’s  approach,  a  heuristic  multitarget  average 
likelihood  /eamleC^  I  is  defined  for  a  Gaussian  sensor  in  Poisson  clutter  and  incorporated  into  a  conventional 
Extended  Kalman  Filter  (see  Section  4  below). 

Table  II  summarizes  the  history  of  multitarget  Bayesian  recursive  nonlinear  filtering  problem  when  the  number  n 
of  targets  is  not  known  and  must  be  determined  along  with  the  individual  target  states.  As  already  noted,  in  the 
unknown-/!  case  multitarget  recursive  Bayesian  nonlinear  filtering  runs  afoul  of  serious  conceptual,  theoretical,  and 
practical  difficulties  [29,30].  Only  some  of  the  researchers  listed  in  Table  II  have  addressed  (or  even  been  aware 
of)  these  problems.  The  earliest  work  appears  to  be  due  to  Miller,  O’Sullivan,  Srivistava,  et.  al.  at  Washington 
University  in  St.  Louis.  Their  very  sophisticated  approach  requires  solution  of  stochastic  diffusion  equations  on 
non-Euclidean  manifolds.  It  is  also  unique  in  that  it  is  apparently  the  only  approach  to  deal  with  continuous 
evolution  of  the  multitarget  state.  (All  other  approaches  listed  in  Tables  I  and  II  assume  discrete  state-evolution.) 
Mahler  was  apparently  the  first  to  deal  with  the  discrete  state-evolution  case  in  complete  generality  (Bethel  and  Paras 
assume  discrete  observation  and  state  variables).  Kastella’s  "joint  multitarget  probabilities"  are  just  a  renaming  of 
a  special  case  of  Mahler’s  approach  (see  Section  6  below). 


TABLE  n:  Multitarget  Recursive  Bayesian  Nonlinear  Filtering:  No.  of  Targets  Unknown 


Date 

Author(s) 

Theoretical  Basis 

1991 

Miller  el.  al.  [19,37,42] 

"Jump  Diffusion " 

Approx,  solution  of  multitarget 
stochastic  PDEs 

1994 

Bethel  and  Paras  [5] 

Discrete  Bayesian  filtering 

1994 

Mahler  [22,23,34] 
"Finite-Set  Statistics" 

Random  set  theory 

1996 

Barlow  et.  al.  [4] 

"Unified  Data  Fusion " 

Heuristic 

1996 

Mahler-Kastella  [13,16,40] 

"Joint  Multitarget  Probabilities " 

"Finite-set  statistics"  under 
a  new  name  and  notation  [24,29,30,40] 

1997 

Portenko  et.  al.  [39] 

Point  processes 

1.3  Summary  of  the  Paper.  The  paper  is  organized  as  follows.  In  Section  2  we  begin  with  a  brief  review  of  the 
basic  elements  of  recursive  Bayesian  nonlinear  filtering  and  estimation.  In  Section  3  we  show  that,  when  the 
number  n  of  targets  is  unknown,  even  the  most  basic  of  issues-precise  definition  of  the  concept  of  a  multitarget 
state~“leads  to  trouble.  In  Section  4  we  clarify  the  key  issues  underlying  a  special  case—when  the  number  n  of 
targets  is  assumed  known—hy  contrasting  it  with  the  EAMLE  filter.  In  the  sections  that  follow,  we  focus  exclusively 
on  the  key  issues  associated  with  the  unknown-n  problem.  In  Section  5  we  contrast  the  filtering  aspects  of  the 
unknown-/i  problem  with  an  ad  hoc  scheme  called  "generalized  EAMLE."  In  Section  6  we  contrast  the  estimation 
aspects  of  the  unknown-n  problem  with  "JMP"“Which,  as  just  noted,  is  a  renaming  of  a  truncated  version  of  FISST. 
Section  7  describes  the  specific  problems  associated  with  stable  convergence  of  two  such  estimators  to  the  true 
solution.  Conclusions  may  be  found  in  Section  8. 

2.  Bayesian  Single-Sensor,  Single-Target  Filtering  and  Estimation:  A  Review 

Suppose  that  our  goal  is  to  determine  the  kinematic  state  x  of  a  single  moving  target  on  the  basis  of  point 
observations  ,...,z„  collected  by  a  single  sensor.  In  the  discrete-time  Bayesian  framework  we  assume  the 
following  information: 

•  Measurement  space  =  precise  description  of  all  measurements  z  that  can  be  collected  by  the  sensor 

•  State  space  =  precise  description  of  all  parameters  x  necessary  to  uniquely  describe  all  target  states 

•  Likelihood  function:  f(z  |  x)  =  likelihood  of  seeing  observation  z  given  that  the  target  has  state  x 

•  Markov  motion  model:  f„+j  |  |  xj  =  likelihood  of  target  being  in  state  x^+j  at  time-step  a+i, 

given  that  it  was  in  state  x^  at  time-step  a 

•  Initial  state  model:  fo(x^  =  likelihood  that  target  was  initially  in  state  Xq 

•  Posterior  density:  /„  j  |  Z*)  =  likelihood  that  the  target  has  state  x„  at  time-step  a,  given  that 

observations  Z„  =  {z^  ,...,zj  have  been  collected 

2.1  Recursive  Bayesian  Nonlinear  Filtering  Equations.  Given  this  and  certain  additional  independence 
assumptions,  the  discrete-time  Bayesian  nonlinear  filtering  equations  (7a)  and  (7b)  show  how  to  recursively 
propagate  the  posterior  density  as  each  measurement  is  collected  [1,2,7,9,41]. 

2.2  State  Estimators.  The  posterior  density  /« |  |  Z“)  contedns  all  of  the  information  that  we  need  to  estimate 

the  state  of  the  target  at  time-step  a,  but  this  information  is  unavailable  for  practical  application  unless  we  also  have 


a  ”mathematical  can  opener^—a  state  estimator— ihaX  enables  us  to  extract  it  from  the  posterior.  A  (static)  estimator 
of  the  state  x  is  any  family  xfzj  of  state-valued  functions  of  the  (static)  measurements  z,  ,...,Zm  . 

"Good"  Bayes  state  estimators  x  should  be  Bayes-optimal  in  the  sense  that,  in  comparison  to  all  other  possible 
estimators,  they  minimize  the  Bayes  risk  [43,  pp.  54-63] 

Rc(x,m)  =  S  '  '  ’  i  i  . . ".Zm  1  x)f(\)  d\  dz,  •  •  •  dx^ 

for  some  Specified  cost  (i.e.,  objective  or  loss)  function  C(x,y)  defined  on  states  x,y.  Secondly,  they  should  be 
statistically  consistent  in  the  sense  that  x(z,  ,...,zj  converges  to  the  actual  target  state  x  as  the  number  of 
measurements  m  tends  to  infinity.  Other  properties  (e.g.,  asymptotically  unbiased,  rapidly  convergent,  stably 
convergent,  etc.)  are  desirable  as  well.  In  Bayesian  theory,  the  two  most  common  "good"  estimators  are  the 
maximum  a  posteriori  (MAP)  and  expected  a  posteriori  (EAP)  estimators: 

Xuaf(^i  ,...,zJ  =  argsupj(x  |  z,  .■■■„zj,  . . ,zj  =  J  xf(x  \  z,  ,...„zj  dx 

(where  ”argsup”  means  find  the  value(s)  x  of  x  which  maximize  f(x  |  ,..,zj).  Though  it  plays  an  important 

role  in  theory,  the  EAP  estimator  often  results  in  unstable  solutions  in  applications  in  which  the  posterior  has 
multiple  significant  modes  (e.g.  low-SNR  scenarios  in  which  insufficient  data  has  been  collected  to  ensure  the 
existence  of  a  single  dominant  mode). 

2.3  Provable  Convergence  of  Estimators.  An  important  point  that  is  often  overlooked  is  that  the  "goodness"  of 
an  estimator  crucially  depends  on  certain  seemingly  esoteric  mathematical  concerns.  For  example,  Wald’s  proof 
[44]  that  the  MAP  estimator  is  consistent  requires  the  following  assumptions: 

•  The  space  of  all  measurements  z  is  a  topological  space  satisfying  certain  properties; 

•  the  space  of  all  states  x  is  a  metric  space  satisfying  certain  properties;  and 

•  f(z  I  x)  is  measurable  in  the  variable  z  (with  respect  to  the  measurement-space  topology)  and  continuous 
in  the  variable  x  (with  respect  to  the  state-space  metric). 

3<  What  is  a  Multitarget  State? 

In  the  unknown-n  problem  we  encounter  basic  conceptual  difficulties  even  before  getting  started.  Multitarget 
posterior  densities  f(Xj , . . .  |  Z)  cannot  even  be  defined  unless  one  has  at  hand  a  multitarget  measurement  model 
f(Zj  I  that  tells  us  the  likelihood  of  seeing  measurements  Zj  given  the  presence  of  targets 

with  states  Xj  ,  However,  we  cannot  even  define  a  likelihood  function  unless  we  first: 

•  precisely  define  the  state  and  measurement  spaces, 

•  define  topologies  on  the  state  and  measurement  spaces, 

•  define  random  variables  on  the  state  and  measurement  spaces  using  these  topologies,  and 

•  define  the  multitarget  likelihood  function  as  a  conditional  distribution  of  these  random  variables 

But  how  do  we  uniquely  specify  all  of  the  states  that  the  multitarget  system  can  occupy?  Multitarget  states  must 
look  something  like  this: 


0  :  no-target  state 
Xi  :  the  single-target  states 
Xi  ,X2  :  the  two-target  states 
Xi  ,,..,Xn  :  the  n-target  states 


However,  this  naive  specification  of  multitarget  states  is  incomplete  The  symbol  x,x  signifies  not  that  a  single 


target  with  state  x  is  present  twice,  but  rather  that  two  completely  different  targets  happen  to  occupy  the  same 
kinematic  state  x.  Strictly  speaking,  therefore,  the  state  of  an  individual  target  is  not  fully  specified  (in  a 
multitarget  context)  unless  a  unique  identifier  (an  "I.D.  tag")  has  been  attached  to  it,  e.g.:  where  x  is  the 

target’s  kinematic  state  and  r  is  its  unique  identifying  tag.  Thus  the  incompletely  specified  two-target  state  x,x 
should  be  replaced  by  the  completely-specified  two-target  state  Two-target  states  of  the  form  (x,t), 

(y.r)  with  x  y  must  be  excluded  as  non-physical  since  the  same  target  cannot  occupy  two  different  kinematic 
states  simultaneously.  Also,  note  that  the  two-target  state  (Xi  ,t,),  (X2  ,T2)  and  the  two-target  state  (Xj  ,T2),  (xj 
,r,)  are  not  distinct:  They  represent  the  same  two-target  state.  What  results  is  a  rather  complicated  state  space 
which  is  partially  discrete  and  partially  continuous. 

Finite-set  statistics  (FISST)  resolves  these  technical  problems  by  representing  multitarget  states  geometrically— that 
is,  as  finite  sets  whose  elements  are  the  states  of  the  individual  targets,  endowing  multitarget  states  and  multitarget 
observations  with  suitable  topologies/metrics,  and  then  showing  that  the  resulting  multitarget  state  space  is 
sufficiently  well-behaved  to  ensure  good  filtering  behavior  [6,  pp.  194-198]. 

4.  Heuristic  versus  True  Multitarget  Likelihood  Functions 

When  the  number  of  targets  is  known,  it  would  appear  that  the  problems  summarized  in  Section  3  can  be 
sidestepped  because,  in  this  case,  multitarget  states  can  be  modeled  as  ordinary  vectors.  This  belief  is  mistaken. 
Given  that  we  have  multitarget  likelihood  function  f(Z]  ,...,z*  |  X;  ,...,xj,  how  can  we  systematically  construct  it 
from  a  knowledge  of  the  individual  sensors-rather  than  assuming  that  it  is  either  a  heuristic  contrivance  or  a 
mathematical  abstraction  that  comes  out  of  nowhere,  deus  ex  machinal  These  key  issues  underlying  known-/i 
multitarget  filtering  are  best  illustrated  by  examinining  the  event-averaged  maximum  likelihood  (EAMLE)  filter 
described  by  Kastella  in  1993.  [14] 

4.1  The  EAMLE  Filter.  Suppose  that  a  known  number  n  of  unknown  targets  is  observed  in  Poisson-distributed 
clutter  by  a  Gaussian  tracking  radar.  If  x,  ,...,x„  are  the  states  of  the  targets,  concatenate  them  into  a  single 

"system-level"  state  vector  X  =  (xi . xj.  Likewise,  if  Zj  are  the  observations  collected  by  the  sensor, 

then  concatenate  them  into  a  single  "system-level"  observation  vector  Z  =  (z,  ,...,zj.  Kastella  contructs  a 
heuristic  multitarget  "average  likelihood"  function 

fEAMLE(^  I  X,/i)  =  I  \,n,a)  p(n,a) 

where  the  summation  is  over  all  possible  associations  a  between  states  x,  ,...,x„  and  measurements  z,  . z„ 

and  where  f(Z  |  X,n,a)  is  the  likelihood  given  a  particular  association  a.  Assume  that  the  between-measurements 
evolution  of  the  "system"  state  vector  X  can  be  modeled  by  a  conventional  Kalman  equation  X^+i  =  A^X^  -1- 
.  If  foPi  I  n)  denotes  prior  knowledge  of  the  system  state  vector  X,  then-assuming  that  fEMieC^  I  X,«)  is  a  true 
multitarget  likelihood  function— argsup^fCZ  |  X,n)f(,(K.  \  n)  provides  an  optimal  estimate  of  the  system-level  state 
X.  Using  these  measurement  and  motion  models,  the  multitarget  system  vector  X  can  be  tracked  by  an  EKF. 

This  discussion  implies  that  known-«  filtering  can  be  accomplished  entirely  within  a  "naive"  vector-based  paradigm 
based  on  standard  filtering  theory.  In  reality,  such  a  perspective  overlooks  a  key  point: 

•  Is  /eamleC^  I  X,rt)  a  true  multitarget  likelihood  function  or  is  it  an  ad  hoc,  purely  heuristic  contrivance? 

4.2  True  Likelihood  Functions:  Single-Target  Case.  For  purposes  of  illustration,  consider  a  simpler  problem: 
single-target  filtering.  Here,  the  likelihood  function  f(z  \  \)  does  not  appear  out  of  nowhere.  We  are  able  to 
systematically  construct  it  from  an  explicit  sensor  model  z  =  h(x,vi)  that  describes  the  way  a  target  with  state 
X  generates  sensor  measurements  z  under  the  influence  of  random  noise  w  described  by  a  probability  density 
/w(y).  For  example,  let  C  be  a  matrix  and  suppose  that  the  sensor  model  is  z  =  Cx  -1-  w.  Then 


Is/zry|x;dy  =  Pr(z€S)  =  Pr(Cx+yy  &  S)  =  Pr(yv  E  S-Cx)  =  is.c.fJy)dy  =  isf.(y-Cx)dy 


where  S-Cx  denotes  the  set  of  all  s  -  Cx  with  s  G  5.  Since  this  is  true  for  all  measurable  S,  the  likelihood 
function  is  fjy  \  x)  =  f^(y-Cx)  almost  everywhere  in  y.  In  other  words:  We  know  that  f^(y  \  x)  is  the  true 
likelihood  function  for  the  problem  (and  not  just  a  heuristic  contrivance)  because  it  can  be  systematically  and 
rigorously  constructed  from  the  actual  sensor  model  using  standard  probabilistic  reasoning.  (In  purely  mathematical 
language:  The  likelihood  function  fjy  |  x)  can  be  explicitly  constructed  as  the  Radon-Niko<fym  derivative  /,  = 
dpjdk,  with  respect  to  Lebesgue  measure  X,  of  probability  measure  p^(S)  =  Pr(z  G  S).)  In  like  fashion  we  ask: 

•  How  do  we  construct  multitarget  sensor  models  of  the  general  form  Z  =  hQC,W)  where  Z  is  a 
multitarget  measurement,  X  is  a  multitarget  state,  and  W  is  a  random  clutter  process? 

•  How  do  we  rigorously,  systematically  construct  multitarget  likelihood  functions  f(Z  |  X)  from  the  model 
Z  =  h(X,W)  using  standard  probabilistic  procedures? 

4.3  FISST  Multitarget  Likelihood  Functions.  Addressing  these  questions  is  one  of  the  advances  made  possible 
by  FISST.  The  basic  idea  is  this:  We  construct  an  explicit  multitarget  measurement  model  of  the  form 

Z  =  r  U  W 

where  the  randomly  varying  finite  set  T  =T(X)  models  the  observations  generated  by  the  actual  targets  and  where 
the  randomly  varying  ^te  set  W  models  the  false  alarm  and/or  clutter  observations.  Assume  that  spurious 
observations  are  independent  of  actual  observations.  Then 

lsfz(Y\X)dY  =  PrCZ^S)  =  PrfTVW  Q  S)  =  S  s^e  ^  y/tO^-E  \  X)  ^E)  6Y 

where  the  integrals  are  multi-object  "set  integrals"  (see  [6,  pp.  141-144;  34,  p.  334]  and  equation  (15)  below). 
Since  this  is  true  for  all  measurable  S,  we  get 

f^(Y\X)  =  J:s^YfT(y-E\X)U(E) 

where  frCY  \  X)  can  be  explicitly  constructed  from  the  underlying  sensor  measurement  model  z  =  h(x,vi),  and 
where  f^(E)  can  be  explicitly  constructed  from  an  underlying  clutter  measurement  model  z  =  g(x,y).  Indeed,  as 
early  as  November  1993,  Mahler  showed  how  to  use  the  "set  derivative,"  [6,  pp.  150-152,20,22,27,34]  to 
systematically  and  rigorously  construct  multitarget  likelihood  functions  /zfZ  |  X),  fjfZ  |  X),  fy,(Z)  from  underlying 
sensor  models  even  when  the  number  of  targets  is  unknown.  Consequently,  just  as  the  equation  Pr(z  E  S)  =  j  ^ 
f(y  I  x)  dy  is  the  fundamental  relationship  that  allows  us  to  conceptually  link  single-target  sensor  measurement 
models  with  likelihood  functions,  so  the  FISST  equation  Pr(Z  Q  S)  =  J  ^  f(Y  \  X)  SY  is  the  fundamental 
relationship  that  allows  us  to  conceptually  link  multitarget  sensor  measurement  models  with  "global"  (i.e.,  true 
multitarget)  likelihood  functions.  In  particular:  In  April  1994  we  demonstrated  [6,  pp.  228-235;  20]  that,  when 
FISST  is  restricted  to  the  special  situation  addressed  by  EAMLE  then 

(8)  fFlSSr(^i  I  *1  . . I  *1  . . 

That  is:  Using  FISST,  one  can  prove  that  the  heuristic  multitarget  likelihood  |  X,n)  actually  is  the  true 

multitarget  likelihood  function  for  this  particular  multitarget  problem. 

5.  Statistically  Coupled  versus  Statistically  Decoupled  Multitarget  Probability  Distributions 

We  now  turn  exclusively  to  applications  in  which  the  number  n  of  targets  is  not  known.  Some  of  the  key  issues 
involved  in  the  unknown-n  problem  are  best  illustrated  by  describing  "generalized  EAMLE",  a  multitarget  filtering 


scheme  devised  at  Lockheed  Martin  by  Kastella  in  May  1993  [17]  that  fails  to  exhibit  any  grasp  of  these  issues. 


5.1  "Generalized  EAMLE"  Filter.  Generalized  EAMLE  consists  of  the  following  ad  hoc  sequence  of  Bayesian 
and  non-Bayesian  steps: 

Step  1:  Let  fo(X.  \  n)  be  the  prior  over  all  system  state  vectors  X  with  n  targets,  i.e.  /  /ofX  \n)  dK  =  1 
Step  2:  Let  f(L  (  X,n)  be  the  EAMLE  average-likelihood  function  (X  is  a  system  state  vector  with  n  targets) 
Step  3:  Construct  a  mean-likelihood  function  f(L  \n)  =  \  f(L  \  X,n)foPL  \  n)  dX  for  each  n  =  1.2,... 

Step  4:  Determine  the  most  likely  value  of  n  via  an  ML  estimator:  i.e.  h  =  argmax„f(L  \  n) 

Step  5:  Given  h,  use  a  MAP  estimator  to  find  the  value  of  the  multitarget  state  X,  i.e., 

XU.H  =  argsupfCZ.\X,A)foCX\fi) 


(Since  the  only  description  of  this  approach  is  an  unpublished  technical  document,  we  reproduce  the  relevant  text 
as  an  Appendix.)  Generalized  EAMLE  is  "horizontal"  in  structure:  The  unknown-/i  multitarget  estimation  problem 
is  dissected  into  parallel,  statistically  decoupled  known-n  problems  for  «  =  1,2,...,  with  each  known-«  problem 
being  processed  using  one  cycle  of  an  EAMLE  filter  (see  Figure  1).  Multitarget  state  space  is  partitioned  into 
separate  state  spaces,  one  for  each  target  number  n  =  1,2,...  (the  zero-target  case  is  overlooked  entirely).  The 
likelihood  functions  f(Z  |  X,n)  for  n  =  1,2,...  presume  that  the  number  of  targets  is  fixed.  Prior  knowledge  is 
specified  by  separate  prior  probability  densities  fgCX  \n)  for  n  =  1,2,...  Target  number  is  determined  not  by 
directly  comparing  all  multitarget  states  X  =  ,...,>0  with  all  multitarget  states  Y  =  (y,  ,...,y„.)  for  arbitrary 

target  number  n,  n'  but,  rather,  by  computing  mean-likelihoods  /(Z  |  n)  separately  for  each  known-«  problem 
and  then  in  effect  comparing  an  "typical  n-target  state"  to  a  "typi(^  n '-target  state".  This  statistical  decoupling 
of  the  xmknown-«  problem  has  the  following  consequences: 

•  Because  the  priors  fgPi  |  n)  are  decoupled,  they  contain  no  prior  knowledge  fo(n)  regarding  the  number 
of  targets 

•  Because  the  parallel  known-n  filters  are  decoupled,  probability  mass  cannot  shift  from  one  row  of  Figure 
1  to  another. 

•  Because  state  estimation  is  decoupled,  multitarget  states  with  differing  target  number  cannot  be  directly 
compared  with  each  other  to  more  effectively  determine  which  are  most  or  least  likely. 

5.2  Key  Points  of  Multitarget  Recursive  Bayesian  Filtering.  The  source  of  these  difficulties  is  the  fact  that 
generalized  EAMLE  exhibits  no  conceptual  grasp  of  the  following  key  points,  especially  item  (11): 

(9)  Reconceptualize  a  group  of  unknown  targets,  of  unknown  number,  as  a  single  joint  multitarget  system 

(10)  Reconceptualize  target  number  and  individual  target  states  as  just  specific  parameters  of  a  single  joint 
multitarget  state 

(11)  Reconceptualize  priors,  posteriors,  and  likelihoods  as  single  densities  involving  an  unknown  number  of 
targets  and  target  states 

(12)  Reconceptualize  detection  and  estimation  as  different  aspects  of  a  single,  simultaneous  statistical  process 

(13)  Recognize  that  one  cannot  just  "declare  victory":  reconceptualization  is  the  easy  part,  because  it  leads  to 
unexpected  theoretical  and  practical  difficulties  that  caimot  be  swept  under  the  rug 

5.3  FISST  Statistically  Coupled  Filtering.  These  key  points  are  the  conceptual  foundation  of  true  (i.e.,  non- 
nalve)  multitarget  Bayesian  recursive  nonlinear  filtering~and  the  FISST  approach  to  it  in  particular.  This  tq>proach 
is  vertical  in  structure:  It  correctly  models  the  statistical  coupling  between  all  multitarget  states  and~in  particular- 
regards  multitarget  priors  and  posteriors  as  single  probability  distributions  (see  Figure  2).  For  example,  the  FISST 
"global  maximum  likelihood  estimator  (global  MLE)"  [20,22,27,34]  was  introduced  in  Nov.  1994.  The  global 


n=l  /o(X|l)  _  /(Z|X,l)/,(Xll)  ^  /(ZlD 

/i  =  2:  /o(X|2)  ^  /(Z|X,2)/o(X|2)  /(Z|2) 

n  =  ^:  /.(XU)  ^  f{Z\X,k)f^{X\k)  f(Z\k) 


Figure  1:  "Generalized  EAMLE"  The  cx>nceptual  key  points  underlying 
multitarget  Bayesian  nonlinear  filtering— unified  multitarget  state  spaces,  joint 
multitarget  probability  distributions,  and  simultaneous  estimation  of  multitarget 
states— are  illustrated  via  comparison  with  "generalized  EAMLE,"  an  ad  hoc 
approach  that  fails  to  grasp  these  points.  Generalized  EAMLE  has  a 
"horizontal"  structure  (above):  The  multitarget  filtering  problem  is  broken  up 
into  parallel,  completely  decoupled  filters.  Each  filter  deals  with  a  different 
hypothesis  about  the  number  n  =  7,...,^  of  targets  (the  zero-target  hypothesis 
is  neglected).  Because  the  priors  |  n)  are  decoupled,  they  provide  no 
means  of  specifying  initial  belief  /^(n)  regarding  the  number  of  targets. 
Because  the  parallel  filters  are  decoupled,  probability  mass  cannot  shift  from  one 
row  to  another.  Because  state  estimation  is  decoupled  (i.e.,  target  number  is 
determined  by  computing  a  marginal  likelihood  f(Z  \  n)  for  each  row), 
multitarget  states  with  differing  target  number  cannot  be  directly  compared  with 
each  other  to  determine  which  are  most  or  least  likely. 
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Figure  2:  Multitarget  Bayesian  nonlinear  filtering,  by  way  of  contrast,  has 
a  "vertical"  structure  (above)  that  correctly  models  the  coupling  between  (i.e., 
relative  probability  contributions  of)  all  multitarget  states.  It  does  so  via 
multitarget  densities  that  are  "global"  ("joint")  in  that  they  are  single  densities 
involving  an  unknown  number  of  targets  and  target  locations,  defined  over  a 
unified  multitarget  state  space.  The  multitarget  prior  /^fX)  is  a  "global" 

collection  {^(X  |  i)fo(i)}i^o.i . *  of  conventional  densities  coupled  together  by 

a  joint  normality  condition  j  foQQ  bX  -  1  (where  ’  J  ’  denotes  a  multitarget 
or  "set"  integral).  The  multitarget  posterior  f^QC  |  Z)  is  a  "joint  multitarget 
probability"  \fo(X.  \  Z,i)  f(i  \  Z)}i,^o,i....,k  of  the  same  type.  The  joint 
multitarget  likelihood  function /(Z  |  X)  is  a  joint  collection  {f(Z  \ 


MLE  integrated  the  functions  of  detection  and  localization  into  a  single,  simultaneous,  and  provably  optimal 
statistical  operation.  Suppose  the  sensor  collects  observation-sets  :  Zy  ,...,Z^.  Then  the  global  likelihood 
function  is  L(K  |  =  /^(Zy  \X)  •  •  •  f^(Z^  \  JQ-i.e.,  a  single  density  involving  an  unknown  number  of  targets 

and  target  states.  The  global  MLE  is  [22]: 

argsup  =  argsup 


where  "argsupx"  denotes  that  unique  value  of  X  which  (if  it  exists)  maximizes  LPC  |  It  is  provably  optimal 
because  f(Z  \  X)  is  a  true  multitarget  likelihood  function  (since  it  can  be  computed  directly  from  sensor 
characteristics  using  the  ”set  derivative"  [6,22,34]).  In  early  1994,  three  Lockheed  Martin  internal  technical  reports 
described  the  global  MLE  process  using  simple  numerical  examples.  [6,  pp.  256-259;  27]  In  April  1994,  EAMLE 
was  proved  to  be  a  special  case  of  the  global  MLE.  [20]  Also  in  November  1993,  Mahler  introduced  the  concept 
of  a  "global"  probability  density,  i.e.  one  which  takes  account  of  the  fact  that  both  numbers  of  targets  and  numbers 
of  observations  are  generally  random.  Using  both  set  notation  and  more  conventional  notation,  it  was  shown  that 
such  densities  could  be  equivalently  written  in  the  form 

(14)  /({x,  .x„})  =  n//(Xi  ,...,xj 


and  that,  in  either  notation,  they  were  defined  by  the  joint  normality  condition  [20  p.  9] 

k  k  . 

(15)  Y.  =  1  =  H  -77 


11  =  0 


n»0 


that  was  subsequently  expressed  in  abbreviated  form  as  a  so-called  "set  integral"  J  f(X  |  Z^"^)  dX  =  1 
6.  Optimal  versus  Heuristic  Multitarget  Estimators 

Thus  far  we  have  underscored  difficulties  associated  with  the  filtering  aspects  of  equations  (7a',  7b').  Even  if  we 
have  managed  to  construct  the  multitarget  posterior,  however,  our  work  is  still  not  done.  We  must  also  find  a 
means  of  extracting  from  it  the  information  that  we  want:  estimates  of  the  multitarget  state  variables  (target 
number,  identities,  positions,  velocities,  etc.).  Here,  we  encounter  even  greater  difficulties.  Some  of  the  key  points 
involved  in  unknown-n  multitarget  state  estimation  are  best  illustrated  by  contrasting  them  to  "joint  multitarget 
probabilities  (JMP)",  a  renaming  of  FISST  core  concepts  that  fails  to  address  these  key  points. 

6.1  "Joint  Multitarget  Probabilities  (JMP)".  Describing  it  as  "similar  in  some  respects  to  the  random  set 
formalism  of  Mahler"  [13  p.  168],  Kastella  introduced  "JMP"  [13,16]  at  Lockheed  Martin  in  July  1996  as  a  solution 
to  the  unknown-/!  problem.  However,  only  a  trivial  change  of  notation  differentiates  "JMP"  from  a  number  of  core 
FISST  concepts  devised  years  earlier.  [24  Sect’s  2-3,29,30,40  pp.  27-28]  The  relevant  chronology  is  listed  in  Table 
III  below.  From  late  1993  on,  research  on  FISST  was  widely  reported  in  the  form  of  simple  numerical  examples 
and  other  internal  Lockheed  Martin  research  reports;  internal  and  external  presentations;  and  published  conference 
papers.  When  it  appeared  in  July  1996,  "JMP"  (as  a  solution  to  both  unknown-/!  multitarget  filtering  and  multitarget 
information  theory)  reiterated  FISST  "key  points"  developed  during  this  period  (indeed,  even  some  of  the  same 
notation— see  equ.  (14)  above).  These  include: 

•  the  concept  of  "a  single  density  involving  an  unknown  number  of  targets  and  target  locations" 

•  multitarget  posteriors 

•  set  integrals 

•  multitarget  Kullback-Leibler  measures 


•  the  "almost-parallel  worlds  principle  (APWOP),”  a  systematic  approach  to  generalizing  single-sensor, 
single-target  concepts  and  approaches  to  multisensor-multitarget  problems 

In  particular,  Kastella  used  the  APWOP  to  directly  generalize  his  single-target  "discrimination  gain"  approach  to 
the  multitarget  case.  [13,  p.  168,24,40  pp.  27-28] 


TABLE  m:  A  FISST  Chronology 


Date 

Finite-Set  Statistics 

"Joint  Multitarget  Probabilities" 

05/93 

Generalized  EAMLE  [17]  < - 1 

11/93 

Global  MLE  [20,22] 

03/94 

Numerical  examples  [27] 

! 

04/94 

EAMLE  =  special  case  of  global  MLE  [20] 

04/94 

Joint  multi-object  densities  [20] 

1  2‘A 

yrs 

10/94 

Multitarget  posteriors,  multitarget 
Kullback-Leibler  metrics,  APWOP  [34] 

1 

11/95 

Unknown-n  unsolved  [15]  <■ - — * 

03/96 

07/96 

Multitarget  nonlinear  filtering  [23] 

"JMP"  [13] 

Given  this  chronology  and  our  discussion  in  Section  4,  any  claim  that  "JMP"  is  just  an  obvious  embellishment  of 
"generalized  EAMLE"  (and  therefore  that  "JMP"  actually  pre-dates  FISST)  would  simply  not  be  credible.  On  the 
one  hand,  in  Section  4  we  demonstrated  that  generalized  EAMLE  is  an  hoc  procedure  that  exhibits  no 
understanding  of  any  of  the  conceptual  "key  points"  (9)-(13)  of  multitarget  Bayesian  recursive  nonlinear  filtering 
(and  of  "JMP"  in  particular).  On  the  other  hand,  as  late  as  November  1995,  Kastella  reported  being  still  in  search 
of  a  satisfactory  solution  of  the  unknown-/i  problem—despite  the  prior  existence  of  "generalized  EAMLE" .  (To  wit: 
"Some  basic  questions  for  future  research  are... How  to  extend  the  results  to  the  case  of  an  unknown  number  of 
targets..."  [15  p.  43]).  It  would  be  no  more  credible  to  claim  that  "JMP"  is  "simpler"  than  FISST.  A  trivial 
change  of  notation  constitutes  neither  a  simplification  nor  a  technical  advance.  Neither  does  inattentiveness  to 
unexpected  research  difficulties-for  example,  recognition  that  optimal  estimation  in  multitarget  Bayesian  filtering 
caimot  be  taken  for  granted;  or  recognition  of  the  closely  related  fact  that  continuous  variables  create  fundamental 
practical  difficulties  in  multitarget  Bayesian  filtering.  [6  pp.  190-194,23,29] 

Specifically,  in  "JMP"  state  space  is  completely  discretized,  with  the  following  assertion  that  the  continuous-state 
case  follows  immediately  from  a  trivial  change  of  notation: 

•  "For  simplicity,  the  target  space  is  discretized  into  a  collection  of  cells  (in  the  continuous  case,  the  cell 
probabilities  can  be  replaced  by  densities  in  the  usual  way)."  [16  pp.  123] 

(In  actuality,  passage  from  the  discrete-variable  to  the  continuous-variable  case  introduces  fundamental 
complications.  [6  pp.  190-194;18])  Given  this,  a  JMP  has  the  form  [13  p.  167,16  p.  122] 

(16)  pic^,...,c^\Z)  or  . 


where  Z  is  a  set  of  observations,  n  is  the  (unknown)  number  of  targets,  Xi  are  discrete  location  cells,  and 

Cl  are  the  respective  I.D.  classes  of  the  targets  in  these  cells.  This  is  essentially  the  alternative  notation  for 

a  global  posterior  density  described  in  equations  (8)  and  (14).  In  JMP,  the  family  of  probability  distributions  p" 
indexed  by  n  are  treated  as  a  single  joint  probabilistic  entity  by  requiring  [13  p.  168,16  p.  123]  that  they  satisfy 
the  following  normalization  criterion: 
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(17) 


1 


E  E  Pc^^..cS^\ . = 

»=0  . «» 


This  is  the  earlier  FISST  formula  (15)  assuming  that  all  state  variables  are  discrete.  That  is,  a  IMP  is  a  FISST 
global  posterior  density  written  in  terms  of  the  following  alternative  notation  (see  equ.’s  (8,14)): 


(18) 


P(ci . cJZ)  =  —J{{c^,...,c„]\Z),  p"  (X, . xJZ)  =  ^/({(A:,,c,),...,(a:„,c„)}iz): 

n\  I  ■ 


"JMP"  employs  the  multitarget  filtering  equations  (7a\  75')  with  a  simplified  multitarget  motion  model  (50 
(19)  /«.i|.(yiv...yjxi,...,x,)  =  4.i|.(yilxi)  '  /«*,|.(yjxj 


This  motion  model  assumes  that  the  motions  of  the  targets  in  a  scenario  are  independent  and  that  targets  do  not  enter 
or  leave  the  scene.  (In  the  second  prequel  to  this  paper  [30]  we  pointed  out  that  dynamic  multitarget  scenarios,  in 
which  targets  can  leave  or  enter  the  scene,  require  general  multitarget  motion  models  that  more  completely  capture 
the  actual  complexity  of  real  tracking  problems.)  Target  class  and  kinematics  variables  are  assumed  to  be 
independent.  The  multitarget  state  is  computed  using  the  following  multitarget  estimator: 

(20)  =  argsup  p*(x^,...,x^\Z)  where  n  =  argmax  ( p”ix^,...,x^\Z)dx^,...,dx^ 

6.2  Key  Points  of  Multitarget  State  Estimation.  Once  again,  this  discussion  implies  that  unknown-n  filtering 
can  be  accomplished  entirely  within  a  purely  heuristic  generalization  of  standard  filtering  theory.  In  reality,  a  naive 
approach  overlooks  several  key  points: 

•  Is  the  proposed  estimator  equ.  (20)  optimal  or  is  it  an  hoc,  heuristic  contrivance? 

•  In  particular:  Is  it  Bayes-optimal  and  is  it  guaranteed  to  converge  to  the  correct  solution? 

•  Computation  of  the  marginal  (Step  1)  throws  away  information  contained  in  the  multitarget  posterior.  What 
are  the  consequences  of  this  for  rapidity  of  convergence? 

•  What  are  the  stability  properties  of  this  estimator  under  high-noise  or  high-clutter  conditions? 

In  mid-1995  Mahler  proved  that  the  multitarget  estimator  equ.  (20)  is  Bayes-optimal  (the  proof  is  reported  in 
Mathematics  of  Data  Fusion  [6  pp.  190-205].)  However,  it  is  unclear  whether  or  not  this  estimator  is  convergent. 
In  Section  7  below,  we  demonstrate  that  it  is  likely  to  be  slowly  convergent  and  to  produce  unstable  solutions  under 
low-observable  conditions.  At  that  time,  he  also  proved  that  another  estimator  is  both  provably  Bayes-optimal  and 
provably  convergent  (see  Section  7). 

To  more  fully  xmderstand  these  key  points,  let  us  take  the  position  that  a  naive,  heuristic  approach  is  sufficient  and, 
consequently,  that  we  can  proceed  immediately  to  a  declaration  of  victory.  We  assume  that  we  can  define  a 
multitarget  posterior  distribution  function  on  a  multitarget  state  space  that  need  not  be  carefully  defined.  To  keep 
things  simple,  assume  that  all  targets  are  motionless,  exist  in  one  dimension,  and  are  completely  specified  by  their 
locations  on  the  real  line  as  measured  in  meters.  Also  assume  that  a  single  sensor  collects  a  set  Z  of  measurements 
from  the  targets,  whose  number  as  well  as  positions  are  unknown  and  are  to  be  estimated.  One  can  write  a  naive 
posterior  distribution  function  f  (state  \  Z)  on  the  multitarget  state  space,  given  measurements  Z,  as  follows: 


f(0  Z)  =  posterior  likelihood  of  zero  targets 
f(Xj  I  Z)  =  posterior  likelihood  of  one  target  in  state  Xj 
f(Xj  ,X2  I  Z)  =  posterior  likelihood  of  two  targets  in  states  Xj  ,X2 
f(Xj  I  ZJ  =  posterior  likelihood  of  n  targets  in  states  Xj 

Since  the  cumulative  likelihood  summed  over  all  multitarget  states  must  be  7,  it  follows  that 

f(0  I  Z)  +f(l  I  Z)  +  ...  +f(n  I  Z)  +  ...  +f(M\Z)  =  1 

where  f(0  |  Z)  =  /('0  Z)  and  f(n  f  Z)  =  j  /(Xj  \Z)  dxj  •  •  •  dx„  for  n  =  and  where  M  is 

the  maximum  expected  number  of  targets  in  the  scenario.  Now,  let  us  use  a  naive  MAP  procedure  to  estimate  the 
complete  state  of  the  multitarget  system.  Then  we  would  write: 

{ Xi  . x^ }  argsup  /(x, ,...,  x„  |  Z) 


where  h  is  the  estimated  number  of  targets  and  the  ic,  are  the  estimated  positions  of  the  h  targets 
Unfortunately,  there  is  a  problem:  From  the  definition  of  a  Riemann  integral  we  know  that: 

•  f(0  I  Z)  =  unitless  probability 

•  f(l  I  Z)  =  unitless  probability,  so  units  of  f(x,  |  Z)  must  be  1/meter  since  the  units  of  dx/  are  meters 

•  f(2  I  Z)  =  unitless  probability,  so  units  of  f(x,  .Xj  |  Z)  must  be  1/meter^ 

•  f(n  \Z)  =  unitless  probability,  so  units  of /fX;  ,...,x„  I  Z)  must  be  l/meter" 

Consequently  the  "argsup”  operation  is  not  defined  since  the  quantities  f(0  \  Z),  f(x,  \  Z) ,...,  f(Xj  ,...,x„  \  Z) ,  .. 
are  incommensurable  with  respect  to  units  of  measurement.  Thus  the  MAP  estimator  cannot  even  be  defined! 

One  might  try  to  sidestep  this  problem  by  using  Riemann-Stieltjes  integrals.  That  is,  let  g(x)  be  an  arbitrary 
density  function  on  (single-target)  state  space  and  let  G(x)  =  \  f  g(y)  ^  be  its  corresponding  cumulative 
probability  function.  Then  one  could  instead  define  different  multitarget  posterior  densities  h(x,  ,...,x^  \  Z)  using 
Riemann-Stieltjes  integrals 

A(n  I Z)  J  A(Xj ,...,  x„  I Z)  JG(Xj)  - dG(x^)  =  j  /i(x, x,  |  Z)  g(Xj)  -gfx^)  dx^  -dx, 

where,  now,  the  multitarget  distributions 

h(Xi,...,x„|Z)  = 


f(Xi . x^\Z) 

SCXj)  •"g(x„) 


are  unitless.  Then,  the  multitarget  distribution  h  will  not  have  the  incommensurability-of-units  problem  just  noted 
and  so  could  be  used  to  define  a  multitarget  MAP  estimate.  The  price,  however,  is  the  unacceptable  introduction 
of  an  arbitrary  "fudge  factor" -\hs  density  g-into  the  concept  of  a  multitarget  posterior  distribution  h. 

If  we  instead  turn  to  the  EAP  estimator  for  our  salvation,  our  troubles  get  even  worse.  A  multitarget  posterior 
expectation,  if  it  exists,  must  have  the  general  form  J  (x;  ,...,x„)/((x;  .....x,)  |  Z)  d{xi  ,...,x,)  where  the  (as  yet 
to  be  defined)  integral  is  taken  over  all  (thus  far  vaguely  defined)  multitarget  states  (x;  ,...,x„).  Such  an  integral 
caimot  even  be  defined  unless,  at  minimum,  the  multitarget  state  space  is  a  vector  space-in  particular,  unless  it  has 
a  concept  of  addition/subtraction.  But  how  does  one  add  the  zero-target  state  0  to  a  single-target  state  x?  Or 


a  single-target  state  x  to  a  two-target  state  Xj  We  could  attempt  to  address  this  problem  by  embedding  the 
multitarget  state  space  in  a  larger,  enveloping  space  which  is  a  vector  space  (as  indeed  can  be  done  in  many  ways). 
In  this  case,  however,  there  is  no  guarantee  that  the  posterior  expectation  would  yield  values  which  are  actual 
multitarget  states.  Rather,  it  is  more  likely  that  it  would  yield  values  which  are  contained  in  the  enveloping  vector 
space  and  therefore  which  would  have  no  physical  meaning.  In  summary: 

•  Having  been  denied  the  accustomed  security  of  the  classical  estimators,  we  are  forced  to  propose  new  ones 
and  demonstrate  that  they  are  statistically  well-behaved. 

Even  if  a  multitarget  MAP  or  EAP  estimator  could  be  defined,  to  prove  convergence  of  the  multitarget  filter  we 
would  still  need  to  know  that  the  multitarget  likelihood  function  is  measurable  in  the  variables  Zi  ,...,4 
continuous  in  the  variables  Xj  But  measurable  with  respect  to  which  topology  on  the  multitarget 

measurement  space?  Continuous  with  respect  to  which  metric  on  the  multitarget  state  space?  The  latter  question 
is  far  from  trivial.  What  is  the  distance  between  a  single-target  state  x  and  a  two-target  state  Xj  ,xfl  Or  the  zero- 
target  state  0  and  the  single-target  state  xp.  What  is  the  distance  between  the  two-target  states  Xi  ,X2  and 
,Xj  if  Xj  9^  X2?  (The  Euclidean  metric  gives  us  ||  (Xj  ,X2)-(X2  ,Xj)  ||  =  ||  ,X2-Xj)  ||  ^  0.  But  the  state  ”an 

F-16  at  X;  flown  by  Joe  and  an  F-22  at  X2  flown  by  Ralph"  is  the  same  multitarget  state  as  "an  F-22  at  X2  flown 
by  Ralph  and  an  F-16  at  X;  flown  by  Joe".)  One  can  then  try  to  tinker  various  metrics  for  multitarget  state  space, 
only  to  get  pulled  into  a  a  morass  of  arbitrary,  ad  hoc  definitionizing. 

6.3  FISST  Optimal  Multitarget  Estimation.  Resolution  of  such  questions  is  one  of  the  advances  made  possible 
by  FISST.  The  unexpected  research  difficulties  associated  with  continuous-  and  discrete-variable  "global"  (i.e. ,  joint 
multitarget)  posterior  and  prior  distributions  were  quickly  recognized  following  their  introduction  in  Dec.  1994  [34]. 
These  difficulties  were  resolved,  by  July  1995,  via  the  definition  of  two  new  MAP-like  multitarget  estimators  and 
verification  of  their  respective  optimality  properties.  These  results  were  reported  in  Mathematics  of  Data  Fusion 
[6  pp.  194-205].  These  estimators  are: 

•  Marginal  Multitarget  (MaM)  State  Estimator  (Provably  Bayes-optimal;  improved  convergence) 

{x, }  argsup  /({x^ . x^  |  Z)  where  n  argmax  [fix, x_  |  Z)  dx,  - dx„ 

«  n!  •'  ^ 

•  Joint  Multitarget  (JoM)  State  Estimator  (Provably  Bayes-optimal  and  convergent):  for  some  constant  c, 

{Xj x^ }  argsup  /({Xj x,  |  Z)  •  c  " 


(Note:  In  Mathematics  of  Data  Fusion,  MaM  was  called  "GMAP-I"  and  JoM  was  called  "GMAP-II" 

7.  Stable  Convergence  of  Multitarget  Estimators 

The  purpose  of  this  section  is  to  describe  some  of  the  difficulties  associated  with  multitarget  estimation  and  to  assess 
the  relative  merits  of  the  JoM  and  MaM  estimators  described  in  Section  6.3.  We  will  begin  with  an  intuitive 
discussion  in  Section  7.1  that  illustrates  why  the  MaM  estimator  can  be  expected  to  have  less  desirable  performance 
than  the  JoM  estimator.  Then,  in  Section  7.2,  we  will  verify  this  assessment  by  analytically  comparing  the 
convergence  properties  of  these  two  estimators  in  a  simple  model  problem. 

7.1  Why  Estimation  Using  Marginal  Distributions  Leads  to  Trouble.  Under  low-observable  conditions  special 
care  must  be  exercised  in  the  selection  of  state  estimators  in  both  single-target  and  multitarget  problems.  Consider 
the  following  analogy.  The  following  joint  posterior 


f(xj)  -  NjJx)  Noj(y-2)  +  No,ooi(x-5)  N^.ofy) 


describes  the  current  position  of  a  target  on  the  x-y  plane,  where  NJx)  denotes  a  one-dimensional  normal 
distribution  with  variance  This  distribution  is  bimodal  with  a  large  dominant  mode  near  (5,0)  and  a  small 
minor  mode  near  (0,2).  The  true  jomr  estimate  of  (x,y)  is  the  joint  MAP  estimate 

^MAP.yMAp)  =  argsup^yf(x,y)  =  (5.0) 

The  EAP  estimate  is  ^eap  Seap)  =  (0,2)  +  (5,0)  =  (5,2).  This  estimate,  which  lies  on  a  low-probability  part 
of  the  tail  of  the  dominant  mode,  is  not  very  good.  The  reason  is  that  the  value  of  the  EAP  estimator  is  strongly 
influenced  by  the  small  minor  mode.  Suppose,  instead,  that  we  compute  the  marginal  posteriors 

f(y)  =  f(x.y)  dx  ^  Noj(y-2)  +  f(x)  =  J  f(x,y)  dy  =  N.Jx)  +  No,ooi(x-5) 

Notice  that  x,y  are  correlated  since  f(x,y)  ^  f(x)  f(y).  Three  possible  marginal  estimates  can  be  derived: 

(argsupj(x),  argsupyfty))  =  (5,  argsupyf(5,y))  =  (argsupj(x,2),  2)  =  (5,2) 

As  with  the  EAP  estimate,  these  estimates  are  strongly  influenced  by  the  small  minor  mode. 

Now,  under  low-observable  conditions  a  posterior  f(x,y  \  Z)  will  be  highly  multi-modal.  Even  if  it  has  a  single 
well-localized  dominant  mode,  there  will  be  a  number  of  small  minor  "clutter"  modes  that  will  tend  to  jump 
erratically  from  time-step  to  time-step.  If  we  use  the  EAP  estimator  or  any  of  the  above  marginal-based  estimators, 
the  previous  example  shows  that  their  values  will  all  tend  to  unstable  (i.e. ,  vary  erratically)  whereas  the  joint  MAP 
estimate  will  follow  the  dominant  mode  and  therefore  tend  to  be  coherent.  The  same  reasoning  applies  even  if  one 
of  the  state  variables  is  discrete— for  example,  if  the  posterior  f(x,o)  \  Z)  where  x  is  position  (continuous)  and 
o)  is  target  identity  (discrete).  If  x  and  o)  are  correlated  (f(x,o})  ^  f(x)f(o3))  then  deducing  kinematics  and  target 
I. D.  from  the  marginals  f(o))  =  J  f(x,o))  dx,  f(x)  =  will  lead  to  the  same  kinds  of  instability. 

In  fact,  the  same  reasoning  applies  to  determination  of  target  number  and  target  kinematics/I.D.  in  the  multitarget 
case.  If  target  number  is  correlated  with  target  states,  basing  estimates  on  the  marginal  posterior  f(0  |  Z)  = 
/(0  I  Z),  f(n  I  Z)  =  J  f(Xi  ,...,x^  \  Z)  dxj  •  •  •  dx^  will  lead  to  instability  under  low-observable  conditions.  The 
following  example  hi^ights  this  behavior.  Suppose  that  we  are  given  the  multitarget  posterior: 

/f0|Z)  =  1-q,  f(x\Z)  =  qNJx) 

where  a  is  very  small-i.e.,  the  normal  distribution  has  a  very  narrow,  high  peak  that  strongly  indicates  the 
presence  of  a  target.  The  MaM  estimator  will  decide  in  favor  of  the  presence  of  a  target  if 

1-q  =  f(0  \Z)  <  f(l\Z)  ^  q 

i.e.,  if  q  >  16.  That  is,  the  MaM  estimator  completely  ignores  critical  information  contained  in  the  posterior- 
namely  the  variance  a^.  The  JoM  estimator,  on  the  other  hand,  will  decide  in  favor  of  the  presence  of  a  target  if 

<  c  argsup  f(x\Z)  =  — 

^  o 

To  evaluate  this  inequality  we  need  to  select  a  value  for  c.  Assume  that  the  prior  multitarget  density  is  uniform 
in  the  sense  that  fo(0  \Z)  =  A,  \Z)  —  A  AK  If  the  prior  is  accepted  as  a  standard,  the  balancing-point 
for  deciding  between  yes-target  and  no-target  is  A  —  fo(0  |  Z)  =  c  •argsup^fQ(x  \  Z)  =  AcA^  and  thus  c  =  A. 


Accordingly,  the  test  for  the  JoM  estimator  is 

^  (q-^  -  1) 

o 

In  other  words,  the  JoM  estimator  makes  a  more  nuanced  decision  by  balancing  the  two  pieces  of  information  that 
the  posterior  contains  (i.e.,  q  and  a).  Information  supporting  a  no-target  decision— e.g.  q  is  small— can  be 
countermanded  by  sufficiently  strong  information  supporting  a  yes-target  decision— i.e.,  a  is  sufficiently  small. 

7.2  Analytical  Comparison  of  the  MaM  and  JoM  Estimators.  The  purpose  of  this  section  is  to  derive  an 
analytical  comparision  between  the  convergence  behaviors  of  the  JoM  and  MaM  estimators  in  a  simple  model 
problem.  Assume  that  a  single  motionless  target  in  one  dimension  is  located  at  Xq  and  is  observed  by  a  single 
sensor  whose  noise  density  is  f(z  |  x)  =  f(z-x)  where  f(x)  is  a  mean-zero  density.  We  also  assume  that  the  sensor 
detects  the  target  with  probability  Po  —  h  and  that  the  target  is  obscured  by  independent,  uniformly  distributed, 
and  extremely  dense  Poisson  clutter.  That  is,  the  typical  observation  has  the  form  Z  —  T  \J  W  where  T  =  TQQ 
is  the  single  observation  generated  by  the  target  and  W  are  the  clutter  observations.  Assume  that  the  size  of  the 
region  of  interest  is  A,  Multitarget  states  have  the  form  X  =  {x}  and  multitarget  observations  have  the  form  Z 
=  {zj  or  Z  =  0.  The  FISST  multitarget  calculus  shows  that  the  multitarget  likelihoods  for  T,W  are 

fr(0  \x)  =  0,  friz  I  x)  =  f(z-x),  fjz,  .zJ  =  X"  A-^ 

where  X  is  the  Poisson  parameter.  The  FISST  calculus  also  shows  that  the  true  multitarget  likelihood  f(Z  |  X) 
for  all  observations  (target  and  clutter)  is  given  by  f(Zi  |  0)  =  X"*  f(0  \  0)  =  and 

f(.Zi,....zJx)  =  /(Zi.....2„l 

Define  g(0  \x)  =  0,  g(0  j  0)  =  i,  and 
g(Zi.-.z„lx)  = 


/(»  I  x) 


AZf-x)  , 


0)  -  1 


Let  Z,  be  a  sequence  of  observation-sets  and  =  {Z,  .....ZJ.  Also,  let  fo(0)  =  1-Q,  fo(x)  =  QA  ' 

be  the  multitarget  prior  distribution.  Since  the  posterior  distribution  is 


fjZjX)  -/(Z,|X)/o(X)  _  g(ZjX)  -  g(Z„|X)/o(X) 

/  /(Zi  I Y)  - /(Z, I  y)  /o(y)  6y  /  ^(z,  I  x)  -  giz^  \  x)  f^(Y)  bY 


it  follows  that  we  can  use  g(Z  \  X)  as  though  it  were  the  actual  multitarget  likelihood  function.  Now  since  X  is 
assumed  to  be  very  large,  if  Z  is  any  observation-set  then  with  very  high  probability  Z-fxJ  is  a  large  sample  of 
data  drawn  from  the  uniform  clutter  distribution  c(z)  —  A'^.  Consequently,  the  quantity 

=  tIt  ^ 

\^\  i€2 

is  a  Parzen  kernel  estimator  of  the  uniform  distribution:  K(x)  »  c(x)  =  So, 


Define 


X  X 


where  is  the  geometric  mean  of  the  Poisson  distribution  and  <  X.  Thus  a  <  1 

Case  I:  Target  Not  in  the  Scene,  Suppose  that  a  target  is  not  present  in  the  scene.  Then  g0^  \  0)  -  1, 
g(ZP  \x)  ^  of  and  so  f(0  \  =  (l-Q)/(l-Q+Qof),  If  we  choose  the  prior  as  the  reference  standard,  then 

the  /oAf  constant  c  is  determined  by  the  equation  1-Q  —  c*Q  or  c  =  A(Q^-l),  Then  "no-target"  tests  are: 

0-^-7  >  of  (MaM),  1  >  of  (JoM) 


Case  II:  Target  Present  in  the  Scene,  with  High-Resolution  Sensor,  By  saying  that  the  sensor  resolution  is  very 
good  we  mean  that  AH/k  I  where  H  —  f(0).  In  this  case 


g(Z»^x) 


U,  ,1  f  |Z*| 


a*  + 

X 


Assume  that  f(x)  is  "Gaussian-like"  in  the  sense  that  f(0}^  =  If  and  f  f(x)'‘  dx  =  If'^/lf^.  That  is,  /ft/  has 
only  negligible  probability  outside  of  the  interval  +H''k''^.  Then 


/(X 1 2(^5)  "c 


(  l-Q  ifX=0 

|<2!d->(a*  +  A‘r*/(Xo-x))  if  X={;c} 


and  it  may  be  shown  that 

/  fix  I  Z<‘>)  dx  =  ("Tf  * 


and,  consequently,  that  the  "yes-target"  tests  for  the  MaM  and  JoM  estimators  are: 

HA 
X 

Therefore,  MaM  will  generally  converge  more  slowly  than  JoM,  However,  the  differences  will  be  most  pronounced 
when  there  is  significant  prior  belief  that  no  target  is  present  in  the  scene  (i.e.,  Q  ^  0), 

8.  Conclusions 

In  this  paper  we  have  demonstrated  that  when  one  tries  to  generalize  recursive  Bayesian  nonlinear  filtering  to  the 
multitarget  case  with  unknown  number  of  targets,  one  quickly  encounters  serious  troubles  that  directly  bear  on 


practice.  Specifically,  Bayes-optimal  multitarget  estimation  and  filtering  encounters  fundamental  conceptual  and 
practical  difficulties  when  the  number  of  targets  is  unknown.  What  is  at  issue  is  neither  theoretical  hair-splitting 
nor  mere  mathematical  "bookkeeping.”  Rather,  what  is  actually  at  stake  is  our  ability  to  do  Bayes-optimal 
multitarget  filtering  and  estimation  at  all  and,  moreover,  our  ability  to  even  know  what  **Bayes-optimar  means  in 
such  a  context.  We  explained  the  following  key  points  underlying  multitarget  recursive  Bayesian  nonlinear  filtering: 

•  Reconceptualize  a  group  of  unknown  targets,  of  unknown  number,  as  a  single  joint  multitarget  system 

•  Reconceptualize  target  number  and  individual  target  states  as  just  specific  parameters  of  a  single  joint 
multitarget  state 

•  Reconceptualize  priors,  posteriors,  and  likelihoods  as  single  densities  involving  an  unknown  number  of 
targets  and  target  states 

%  Reconceptualize  detection  and  estimation  as  different  aspects  of  a  single,  simultaneous  statistical  process 

•  Recognize  that  one  cannot  just  "declare  victory":  reconceptualization  is  the  easy  part,  because  it  leads  to 
unexpected  theoretical  and  practical  difficulties  that  cannot  be  swept  under  the  rug 

We  concretely  illustrated  the  meaning  of  these  points  by  contrasting  true  multitarget  recursive  Bayesian  nonlinear 
filtering  with  three  approaches  that  fail  to  address  these  difficulties:  the  EAMLE  and  generalized  EAMLE  filters, 
and  a  renaming  of  a  truncated  version  of  FISST  called  "joint  multitarget  probabilities  (JMP".  We  concluded  that 
the  JoM  multitarget  state  estimator  should  exhibit  better  performance  than  the  MaM  estimator. 
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Appendix:  "Generalized  EAMLE"  [17] 


To  illustrate  the  CMLE  method,  consider  the  problem  of  tracking  a  collection  of  )-dimensional  targets  moving  in  a  surveillance  volume  of  length  L.  The  number  of  targets  T  is  known. 
The  state  of  the  system  is  completely  specified  by  the  r-vector  \  =  (k,  From  a  Kalman  prediction  step  or  some  other  type  of  prior  estimation,  we  assume  Gaussian  a  priori  density 

is  available  for  x.  The  prior  is  P„0^)  =  N(K\\„X)  where  N0C;Y,a)  denotes  a  normal  distribution  in  X  with  mean  Y  and  covariance  matrix  a.. ..We  assume  that  the  target  positions 
are  a  scanned  by  sensor  [sicj  that  has  uniform  clutter  with  density  p.  The  known  detection  probability  1  is  the  same  for  all  of  the  targets.  For  the  valid  detections,  the  sensor  is  Gaussian 

with  variance  R.  A  scan  of  data  then  consists  of  the  set  Z  =  (z, . Zy}.  The  sensor  provides  no  information  about  which  of  the  4,  are  clutter  and  which  are  target-originated.  The  CMLE 

consists  of  (1)  computing  the  likelihood  P(Z  |  X)  and  then  2)  using  conventional  maximum  likelihood  estimation  (MLE)  (w  maximum  a  posteriori  estimation  (MAP))  to  estimate  X. 

X  -  orgmax  QhPqQC) +  hiPiZ\X))  (1) 

X 


To  evaluate  P(Z  \  X)  we  average  over  all  possible  assignments  of  the  M  measurements  in  the  scan  Z  to  the  T  tracks  X.... Observe  that  P(Z  j  X)  is  implicitly  conditioned  on  the  number 
of  targets  present  T,  but  we  suitress  that  fOT  now.  This  will  become  important  for  the  detection  problem,  however.  [Usii^  mean-field  methods  we  can  write] 


P(.Z\X)  =  /  (n  ftj 


(13) 


...This  integral  can  be  approximately  evaluated  using  the  saddle  point  method  common  throughout  statistical  physics  and  quantum  field  theory,.The  dominant  contribution  to  the  integral  given 
by  the  stationary  points  and  with  respect  to  and  integral  is  given  by 

P(Z|X)  =  c'  exp(£«^„.e,,z,.,x,)) 

where  c’  is  a  normalization  constant  that  depends  on  r....Ck>mbining  P(Z  \  X)  with  the  {HicMT  X  yields  the  log-likelihood  function  to  be  minimized: 

A  =  i(X-X,)’'S-‘(X-X,)  - 


To  form  the  mean  field  coherent  maximum  likelihood  position  estimate  (MFCMLE),we  must  simultaneously  minimize  A  with  respect  to  X,  17.  ,  and  c, 

2.3.3  MET  and  Detection.  We  now  present  our  approach  for  extendii^  the  MFCMLE  to  attack  the  problem  of  acoustic  detection  in  theater  ASW  using  the  problem  of  detecting  1-dimensional 
randmn  walkers  against  a  clutter  background  as  an  exan^le.  Suppose  that  we  have  K  scans  of  data  Z',...,25.  Each  scan  =»  ^  Wfjy  measurements  in  it  where  M(k) 

is  a  random  variable  that  depends  on  false  alarm  rate,  the  number  of  targets  present  and  their  detection  probabilities.  We  now  denote  the  full  data  set  Z  As  in  the  tracking  case 

studied  above,  we  assume  that  the  detection  probability  P^  fw  the  targets  and  the  false  alarm  density  p  arc  known.  To  estimate  the  number  of  tracks,  we  compute  P(Z  \  T)  where  T  is 
the  number  of  tracks.  The  MLE  estimate  for  T  is  then 


f  =  arg  m!aij.P(Z  \  7^. 


(20) 


In  principal,  once  an  estimate  for  T  is  generated,  the  location  of  the  tracks  is  an  independent  process.  However,  we  will  see  that  evaluation  of  P(Z  |  T)  will  involve  saddle  point  evaluation 
of  an  integral  and  that  this  saddle  point  will  yield  X.  Note  that  for  this  K  scan  problem,  X  is  now  a  T  x  X  matrix  with  elements  x/,  the  position  of  target  t  at  time  k.  We  define 
X*  =  to  be  the  vector  of  target  locations  at  time  k  (analogous  to  the  system  state  vectex  in  the  tracking  examples  above)  and  X,  =  ^r/,...,xA>  to  be  the  trajectory  of  asingle  target. 

To  compute  P(Z  \  T),  we  use  Bayes  rule  to  decon^xjse  the  probability  in  terms  of  the  T-track  state  densities  P(X  |  7), 

P(Z\T)  =  I  dXP(Z|X)P(Xl7^,  (21) 


where  dX  •  II*_/  in  Eq.  (21)  P(X  |  T)  represems  our  aprkxi  knowledge  about  target  behavior.  There  are  two  salient  features  that  characterize  Brownian  walkers.  First,  they 

arise  with  uniform  density  in  the  target  region  so  that  P(x,‘)  =  1/L,  t  =  T.  Second,  if  at  time  k  the  target  is  at  x*  then  the  density  for  its  location  at  time  *-»-/  is  normal  with  mean 
X,*  and  variance  Q,  Pix^*‘  \  x,*>  =  This  exhausts  our  a  priori  knowledge  of  the  walkers  so  we  have  P(X  \  7}  =  PI,./  PI,_A*'  Pix^*‘  j  x*».  The  other  factor  in  the 

integrand  of  Eq.  (21)  is  P(Z  \  X>  =  PI*. A  P0  \  X*)  where  P0  \  X*>  is  given  by  the  integral  Eqs.  (13,15).  Defining  dlf  =  Ut./ 11,./ df*  and  rft?  =  11*./  we  can  combine 

these  factors  to  express  P(Z  |  7}  as  the  integral 


P(Z|T)  -  dXdudZ  eacp 


^  f-1  t-1  *-l 


(22) 


where  is  given  by  Eq.  (15).  Now.  this  integral  can  also  be  solved  using  saddle  point  methods.  The  detection  algorithm  then  consists  of  computing  Eq.  (22)  for  various  values 

of  T  and  using  Eq.  (20)  to  pick  the  best  value  of  T.  Observe  that  the  saddle  point  evaluation  then  yields  estimates  for  the  tracks  X....One  subtle  issue  that  arises  here  is  the  evaluation  of 
the  normalization  C.  For  the  tracking  problem,  we  locate  the  stationary  point  of  the  integrand.  However  in  this  case,  we  must  compare  the  value  of  the  integral  for  different  values  of  T 
which  entails  at  least  af^roximate  evaluatitm  of  f  can  be  expressed  in  terms  of  the  determinant  of  a  matrix  whose  dimension  is  given  by  the  total  number  of  variables  in  the  problem, 
so  some  type  of  approximation  scheme  will  probably  be  required.  Also,  note  that  Eq,  (22)  must  be  evaluated  for  each  possible  value  of  T.  An  approach  to  this  problem  is  to  define  a  new 
function  using  the  approximation  to  the  Kronecker  (-function  of  Eq.  (8), 

At)  =  =  lim  (2ita)-W  f  dn  «-<'‘J'P(Z|r)  (23) 

T  ^  r-o 


P(t)  is  now  a  function  of  a  continuous  variable.  Many  of  the  terms  in  the  (22)  can  be  combined  and  it  is  probably  possible  to  evaluate  the  sum  over  T  in  (23)  in  closed  form.  This  has  the 
advantage  the  it  replaces  separate  Integrals  by  a  single  one. 


